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ABSTRACT:  A  review  of  iterated  transformation 
image  compression  is  presented.  Generalization  of  sim¬ 
ple  iterated  function  system  fractal  generating  algorithms 
to  an  automated  iterated  transformation  algorithm  used 
to  compress  greyscale  images  is  reviewed.  Compressed 
images  from  the  Digital  Terrain  Elevation  Database  are 
presented,  and  compared  with  encodings  using  adaptive 
discrete  cosine  transformations,  and  mean  residual  vector 
quantization  image  compression  techniques. 

1.  INTRODUCTION 

Because  of  the  increasing  use  of  digital  imagery,  there 
is  currently  considerable  interest  in  the  image  compres¬ 
sion  problem.  In  particulaur,  image  compression  is  a  cur¬ 
rent  and  growing  necessity  for  Navy  applications  includ¬ 
ing  storage  and  transmittal  of  maps,  intelligence  pho¬ 
tographs,  weather  information,  etc.  General  interest  in 
image  compression  has  led  to  the  establishment  by  the 
Joint  Photographic  Experts  Group  of  a  standard  based 
on  discrete  cosine  transforms  (ADCT).  There  is  also  an 
on  going  effort  in  the  research  community  to  design  im¬ 
proved  vector  quantization  (VQ)  methods,  and  to  develop 
methods  which  utilize  wavelet  transformations.  A  rela¬ 
tively  new  approach  to  the  image  compression  problem, 
iterated  transformations,  has  been  presented  by  Jacquin 
[1,2].  This  method  has  its  foundation  in  the  theory  of  iter¬ 
ated  function  systems  (IFSs),  developed  by  Hutchinson  [3] 
and  Barnsley  [4],  and  recurrent  iterated  function  systems 
[5].  The  iterated  transform  algorithm  has  received  partic¬ 
ular  interest  because  of  the  /racial  nature  of  the  encoded 
images,  and  because  there  has  been  much  speculation, 
but  little  information  available  on  the  capabilities  of  the 
method.  The  first  sections  of  this  paper  review  the  basic 
methodology  of  the  iterated  transform  image  compression 
technique.  This  is  followed  by  a  section  on  the  compres¬ 
sion  of  the  Digital  Terrain  Elevation  Database  (DTED) 
in  which  results  obtained  using  iterated  t'ansformations 
are  compared  to  ADCT  and  VQ  methods. 


2.  BACKGROUND:  SIMPLE  EXAMPLES 

This  example  serv<»  as  a  simple  illustration  of  wsme  con¬ 
cepts  involved  in  the  iterated  transform  image-encoding 
scheme.  This  example  is  based  on  iterated  function  .sys¬ 
tems.  The  main  concept  is  that  the  image  of  a  set  (a 
Sierpinski  gasket,  in  this  case)  can  be  reconstructed  from 
a  set  of  transformations  which  may  take  less  memory  to 
store  than  the  original  image. 

Consider  the  three  transformations  shown  in  figure  1. 
They  are 

and 
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For  any  set  S,  let 
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iy(5)=Uu;.(S). 

»=:l 

Denote  the  n-fold  composition  of  W  with  itself  as  kV®". 
Define  An  =  W{An-i)  =  kV”"(/lo)  and  arbitrarily  choose 
Ao  as  the  unit  square  with  lower  left  corner  at  the  origin 
(i.e.,  Ao  =  {(^.!/)|0  <*<l,0<y<l)).  Then  as 
n  — »  00,  the  set  An  converges  to  a  limit  set  Aoo  In  fact, 
for  any  compact  set  S  C  ,  kV*"(S)  — ♦  Aoo  as  n  — *  oo. 
Figure  2  shows  Ao,Ai,A2,  A3,  A^,  A5  and  Aoo- 

That  all  compact  initial  sets  converge  under  iteration 
to  Aoo  is  important — it  means  that  the  set  Aoo  is  defined 
by  the  u;,  only. 

Each  is  determined  by  6  real  values,  so  that  for  this 
example  18  floating  point  numbers  are  required.  In  single 
precision,  this  requires  72  bytes.  The  memory  required  to 
store  an  image  of  the  set  depends  on  the  resolution;  the 
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Figure  i.  Three  eifiBe  trunformations  in  the  plane. 


domain,  which  each  un  acta  on  muat  be  stored  as  part 
of  the  encoded  image.  Second,  the  transformations  have 
to  be  generalized  to  three  dimensions.  A  gray  acaie  im* 
age  can  be  thought  of  as  a  three-dimensional  image,  eacli 
pixel  having  an  r,  y  coordinate,  and  an  intensity  value  i. 
A  form  for  the  transformations  which  is  convenient  for 
encoding  gray  scale  images  is. 


X 

at  bi  0 

X 

’ 

li’l 

y 

z 

— 

Ci  di  0 

0  0  ti 

y 

z 

+ 

/( 

. 

Pigare  2.  A«,At,A],A3,Ai,Ai,  snd  A«o. 

Aco  image  requires  256  x  256  x  1  bit  =  8192  bytes  of 
memory.  The  resulting  compression  ratio  in  this  example 
U  113.8 : 1. 

It  is  inherently  difficult  to  find  an  IFS  which  will  encode 
an  arbitrary  set.  Furthermore,  in  this  example,  the  image 
of  the  Sierpinski  gasket  is  described  by  a  set  of  pixels,  each 
being  either  black  or  white.  The  problem  of  more  inter¬ 
est  for  image  compression  applications  is  the  encoding  of 
gray  scale  images  (i.e.,  an  image  in  which  each  pixel  has 
many  possible  gray  levels,  not  just  black  or  white).  There 
are  two  generalizations  to  the  simple  example  given  above 
which  make  encoding  gray  scale  images  feasible.  First,  in¬ 
stead  of  each  W{  operating  on  the  entire  image,  the  Wi  are 
restricted  to  operate  on  a  section  of  the  image.  The  the¬ 
ory  of  IFS’s  has  been  extended  by  Barnsley  and  Jacquin 
[5]  to  allow  transforms  to  operate  on  only  parts  of  the  set 
rather  than  the  entire  set,  in  a  method  they  call  recrr- 
rent  iterated  function  systems.  The  particular  section,  or 


Consider  the  sixteen  transformations. 
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where  the  first  eight  transformations  are  restricted  to  act 
on  the  region  {(x,y)lO  <  *  <  1/2,  0  <  y  <  1/2),  and  the 
second  eight  transformations  are  restricted  to  act  on  the 
region  {(*,  v)ll/2  <»<l,0<y<  1/2).  Similar  to  the 
example  given  above,  the  map  W  is  defined  as  the  union 
of  the  uz/s.  Let  values  of  r  =  0  be  represented  as  black, 
r  =  1  as  white,  with  intermediat  e  values  as  shades  of  gray. 
The  initial  image  Ag  is  ar^  .trarily  chosen  as  z  =  0.5  for 
{(i,y)|0  <  *  <  1,0  <  y  <  1).  The  first  six  iterates,  and 
the  fixed  point  are  shown  in  figure  3.  In  practice,  the 
values  of  z,y,  and  i  are  discretized.  When  the  image  in 
this  example  is  Jiscretized  as  128  x  128  pixels,  and  S-bits 
per  pixel,  the  encoder  used  in  this  paper  [6,7]  automat¬ 
ically  en  .odes  this  image  (using  an  equivalent  set  of  16 
transformations)  with  the  resulting  compression  equal  to 
35b;i. 

3.  ENCODING  AND  DECODING  AN  IMAGE 


The  question  that  must  be  answered  is,  given  an  im¬ 
age,  what  is  the  method  for  finding  transformations  that 

48.2.2 
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Figtire  3.  Ao,Ai,A2,A3,At,Ai,A6,  and  Aco  foi  the  16 
transformations  listed  in  the  text. 


encode  it?  The  contractive  mapping  fixed  point  theo¬ 
rem  suggest  how  to  answer  this  question.  The  contractive 
mapping  fixed  point  theorem  guarantees  that,  if  F  is  a 
comp/ete  metric  space,  and  the  map  fV  :  F  F  is  a  coit- 
iraciive  transformation,  there  exists  a  unique  fixed  point 
\W\  =  W^^'iAo),  for  any  /lo  €  F. 

Since  the  limit  set  is  a  fixed  point, 

IW^I  =  fy(HVi)  =  u^idlVl)  U  ■  ■  •  U  u;„(|lV|).  (2) 

This  formula  suggests  how  one  would  seek  the  transfor¬ 
mations  which  encode  a  given  image.  The  goal 

is  to  have  the  fixed  point  \W\  approximate  the  desired  im¬ 
age  /.  The  transformations  should  therefore  be  chosen  to 
satisfy  equation  2  with  \W\  replaced  by  /,  i.e.,  the  trans¬ 
formations,  when  applied  to  /,  should  result  in  /.  The 
•  •  •  1  ^n(f)  are  said  to  cover  the  image  /.  Refer¬ 
ring  back  to  the  two  examples  in  the  previous  section,  it 
is  seen  that,  given  the  Sierpenski  triangle,  or  the  fractal 
square  pattern,  by  satisfying  equation  2  the  transforma¬ 
tions  encoding  these  images  could  be  found. 

In  the  two  examples,  the  covering  W{f)  is  exact.  Given 
an  arbitrary  set  /,  it  is  not  possible  in  general  to  exactly 
cover  /  with  a  finite  number  of  transformations  of  itself. 

48.2.3 


1  h--  I.hvious  !S  then,  wli.il  li:ip jit  it.s  if  ihr  covt^r 

"ig  It  (/)  IS  ajtproxiruatc  ^  A  forulhiry  of  the  rontractivi* 
iiiapjuiig  fixed  point  lh«-oretii,  wiiith  HanisUy  rails  the 
Collage  Iheorctii,  puts  a  hound  on  the  error  fietween  jWj 
and  /  when  lt'(/)  does  not  exactly  equal  /  The  tlKsjrein 
says  that  the  clow-r  the  covering  tC(/)  is  to  the  original 
s<'t  /.  the  closer  tin-  fixed  (loint  |U'l  will  he  to  /,  and  that 
this  is  especially  true  if  the  traii.sfortnatiori.s  composing  tV' 
are  very  coiilractivc. 

In  figure  4  part  of  the  encoding  proceas  for  this  image 
is  illustrated.  The  figure  demonstrates  how  one  .section 
of  the  image,  called  a  range  (Hi),  is  covered  as  closely 
as  possible  by  applying  a  transformation  w,  to  a  domain 
(D,).  To  complefe  the  encoding  process,  a  ti',  and  D, 
must  be  found  to  cover  each  Hi,  and  the  /i,’s  must  com¬ 
pletely  tile  the  image.  To  facilitate  compact  specification 
of  the  transformations,  the  seLs  from  which  D’a  and  R'a 
are  chosen  arc  restricted  to  be  geometrically  simple,  auid 
limited  in  number.  The  le.’s  must  be  chosen  such  that 
upon  iteration,  a  fixed  point  is  reached.  In  light  of  the 
collage  theorem,  it  is  surprising  that  when  the  map  W 
is  constructed,  it  is  not  necessary  to  impose  any  contrac- 
tivily  conditions  on  the  individual  transforms.  The  nec¬ 
essary  contractivily  requirement  is  that  W  be  eventually 
contractive  [8J.  A  map  iV  :  F  -*  F  is  eventually  contrac¬ 
tive  if  there  exists  a  positive  integer  m  such  that  the  m*'* 
iterate  of  W  is  contractive.  Note  that  in  the  gray  scale 
example  of  section  2,  half  of  the  transformations  are  not 
contractive  in  the  z  direction. 

As  shown  for  the  simple  examples  in  the  previous  sec¬ 
tion  (figures  2  and  3),  decoding  an  image  is  performed 
by  starting  with  an  arbitrary  initial  image,  and  iterating 
the  transformations  until  the  fixed  point  is  reached.  This 
process  is  shown  for  an  encoding  of  “tank  farm”  in  fig¬ 
ure  5.  The  compression  of  this  image  was  8.66:1,  and  the 
PSNR  =  -2Qlog(^V|f°'’)  =  33.6  dB. 

4.  APPLICATION  TO  DTED 

Application  of  image  compression  to  geographic  map 
data  is  of  particular  interest  to  the  Navy.  Geographic  map 
data  comes  in  a  variety  of  formats,  and  there  has  been  ex¬ 
tensive  work  done  in  compressing  map  databases  for  vari¬ 
ous  applications.  In  this  section,  the  problem  of  compres- 
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Figure  4.  Part  of  the  encoding  process. 


sion  of  the  Digital  Terrain  Elevation  Database  {DTFT)) 
is  addressed.  The  database  consists  of  elevation  data  for 
a  grid  of  longitude-latitude  coordinates  where  the  grid 
points  are  roughly  100  meters  apart.  A  complete  descrip¬ 
tion  of  DTED  can  be  obtained  from  the  Defence  Mapping 
Agency.  Alward  and  Nicholls  [9]  examined  hierarchical 
data  structures  as  applied  to  DTED.  The  data  structures 
result  in  some  data  compression,  although  compression 
was  not  the  primary  goal  of  that  investigation.  In  this 
section,  the  problem  of  interest  is  evaluating  the  perfor¬ 
mance  of  the  iterated  transform  (IT)  method  on  DTED 
This  would  be  useful  for  applications  where  data  compres¬ 
sion  is  the  primary  concern,  and  issues  such  as  hierarchical 
structure,  access  time,  and  decoding  time  are  of  secondary 
importance.  As  a  means  of  evaluation,  the  DTFJD  images 


Figure  5  Initial  image,  first  iterate,  w-rond  iterate,  and  leritfi 
iterate  for  an  encoding  of  the  "tank  farm”  image 


were  also  conipri'ssi’d  wilti  .an  .MX'  !'  and  mean  resitiual 
vector  qiianlization  algorithm  {.VlfU’Q)  DEFil)  data  can 
be  thought  of  in  terms  of  a  gray  scale  image  where  the  lon¬ 
gitude  ami  latitude  identify  the  pixel,  and  the  elevation  is 
the  pixel  value,  F'or  the  purpose  of  possible  Navy  applica¬ 
tion.  the  data  were  transformed  from  tlieir  original  linear 
scale  between  0  and  10000  meters,  to  a  logarithmic  scale 
between  0  and  255.  The  quantization  results  in  a  compres¬ 
sion  from  H  bits  per  datapoint  to  8  bits  per  dalapoint. 
This  logarithmic  scale,  shown  in  figure  6.  represents  lower 
elevations  more  accurately  than  higher  elevations,  the  ra¬ 
tionale  for  this  being  that  lower  elevations  areas  are  more 
likely  to  be  important  for  Navy  applications,  and  that 
nearly  all  of  the  earth's  surface  (particularly  near  coast¬ 
lines)  is  at  relatively  low  elevation. 

The  iterated  transform  algorithm  was  identical  to  that 
used  in  reference  6  except  for  one  significant  modifica¬ 
tion  The  algorithm  was  modified  to  encode  sections  of  the 
image  on  coastlines  with  increased  accuracy.  F’or  image 
sections  that  contained  coastline,  the  error  criteria  was 
tightened.  This  resulted  in  more  segment  ation,  and  there¬ 
fore  higher  fidelity  in  these  areas  'I'he  ADt’  T  algorithm 
used  was  similar  to  that  described  by  Chen  and  Pratt 
[10).  except  for  a  modification  similar  to  that  described 
above  for  the  iterated  transform  algorithm.  To  encode 
coastlines  more  acriuately.  the  derision  level  quantizer 
table  was  rompres-si'd  for  .sections  of  the  image  on  coast¬ 
lines.  Improving  the  fidelity  at  the  ro.astlines  resulted  in  a 
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Figure  6  .  The  scale  transformation  used  for  the  DTED  data. 


decrease  in  the  overall  compression-fidelity  performance. 
The  MRVQ  algorithm  used  was  based  on  the  method  de¬ 
scribed  by  Linde  et  al.  (11].  Codebooks  were  generated 
from  2  sections  of  DTED  similar  to  (but  not  including) 
the  section  tested. 

The  compression  methods  being  considered  are  prop¬ 
erly  applied  to  images  with  a  dynamic  range  appropriate 
to  the  number  of  bits  used  to  store  the  image,  and  a  rel- 
atively  smooth  distribution  of  gray  level  intensity  values 
over  this  range.  In  figure  7,  the  512  x  512  section  of 
DTED,  which  is  1*  east  of  that  section  shown  in  figure  8 
is  displayed  where  sea  level  is  displayed  in  gray,  all  pixels 
with  an  elevation  that  is  a  multiple  of  100  (±  2)  are  dis¬ 
played  in  black,  and  all  other  elevations  being  displayed  in 
white.  It  appears  that  the  majority  of  the  data  contained 
in  this  section  of  DTED  were  created  from  100-meter  con¬ 
tour  maps,  the  result  being  that  a  disproportionate  num¬ 
ber  of  datapoints  are  at  multiples  of  100  meters.  In  the 
southeast  corner  of  the  map,  it  can  be  seen  that  the  num¬ 
ber  of  points  at  100  meters  are  far  more  dispersed.  In 
this  section  of  the  map,  roughly  the  number  of  datapoints 
that  would  be  expected  based  on  random  elevations  are 
present.  The  biased  quantization  illustrated  by  figure  7  is 
evident  in  other  sections  of  DTED. 

Because  of  this  biased  quantization,  the  reconstructed 
image  resulting  from  compressed  encodings  will  have  a 
smoother  distribution  of  pixel  values  than  the  original  im- 


Figure  7  .  Pixels  of  value  100  ±2  meters 


Table  1  .  ResoiU  for  encodings  of  figure  «. 


Method 

Compression 

FS.NR{dB) 

Figure 

IT 

44.86.1 

32.98 

IT 

21.49:1 

35.08 

9 

ADCT 

47.33:1 

30.51 

ADCT 

21.08:1 

34.92 

10 

MRVQ 

32.00:1 

31.36 

11 

Table  2  .  Result*  for  encodings  of  figare  1 2. 


Method 

Compression 

PSNR(dB) 

Figure 

IT 

104.11:1 

34.09 

IT 

42.08:1 

39.03 

13 

ADCT 

75.53:1 

33.86 

ADCT 

43.76:1 

38.14 

14 

MRVQ 

32.00:1 

36.93 

15 

age.  This  will  lead  to  an  artificially  poorer  measured  fi¬ 
delity  of  the  encoded  images.  In  areas  of  DTED  where 
the  data  are  “properly”  digitized,  this  situation  will  not 
occur.  Because  not  all  the  data  are  quantized  at  a  course 
resolution,  but  only  moet  of  it,  taking  advantage  of  this 
quantization  is  not  simple.  Although  it  has  not  been  done 
here,  before  applying  lossy  compression  techniques  such  as 
iterated  transforms,  ADCT,  or  VQ,  requantization  of  the 
data  in  such  a  way  that  the  majority  of  the  datapoints  re¬ 
tain  their  correct  values  could  result  in  overall  improved 
performance. 

Tests  in  this  section  were  performed  on  two  512  x  512 
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sections  located  in  the  fiords  of  Norway.  These  test  images 
are  shown  in  figures  8  and  12.  Figure  8  covers  the  section 
of  the  earth  from  5®  to  5®  25.6’  east  and  61®  to  61®  12  8’ 
north  and  figure  12  the  section  from  6°  to  6°  25.6'  east 
and  61®  12.8’  to  61®  25.6’  north.  These  sections  of  DTED 
were  chosen  because  the  topology  of  the  fiords  served  as  a 
severe  test  of  the  fidelity  of  the  compression  methods  A 
more  thorough  study  where  a  broad  area  of  the  database 
containing  a  representative  amount  of  flat  and  mountain¬ 
ous  regions  would  be  necessary  in  determining  the  com¬ 
pression  and  fidelity  possible  for  the  complete  database. 

Figures  9,  10,  and  11  show  gray  scale  reconstructed 
images  from  typical  encodings  of  the  image  in  figure  8  us¬ 
ing  iterated  transforms,  ADCT,  and  MRVQ  respectively. 
Similarly,  figures  13,  14,  and  15  show  reconstructed  im¬ 
ages  from  encodings  of  the  image  in  figure  12.  The  fidelity 
and  compression  of  these  (and  other)  encodings  are  sum¬ 
marized  in  tables  1  and  2. 

5.  CONCLUSIONS 

The  compression  versus  fidelity  results  indicate  that  the 
iterated  transform  algorithm  performed  well  when  com¬ 
pared  with  the  ADCT  and  MRVQ  methods.  Other  im¬ 
portant  factors  to  consider  when  comparing  these  com¬ 
pression  algorithms  are  access  time,  decoding  time,  and 
encoding  time.  Iterated  transforms,  along  with  ADCT  are 
variable  bit-rate  methods,  which  would  result  in  slower 
access  times  than  the  fixed  bit  rate  MRVQ  algorithm.  It¬ 
erated  tremsform  encoding  requires  an  extensive  search 
procedure,  making  it  slower  than  MRVQ,  which  also  re¬ 
quires  a  search,  albeit  a  shorter  one.  Iterated  transform 
encoding  is  also  slower  than  ADCT,  which  requires  only  a 
transformation  and  quantization.  For  most  applications, 
encoding  would  be  a  one-time  procedure;  therefore,  en¬ 
coding  time  would  not  necessarily  be  an  important  con¬ 
cern.  If  an  application  required  all  or  a  large  fraction 
of  DTED  be  encoded,  then  the  computer  costs  for  en¬ 
coding  become  significant.  For  many  applications,  the 
speed  of  decoding  an  image  might  be  a  critical  require¬ 
ment.  The  decoding  for  iterated  transforms  is  a  simple 
iteration,  making  it  faster  than  ADCT  (where  decoding 
takes  as  long  as  encoding),  and  slower  than  MRVQ,  which 
is  essentially  a  table  lookup. 
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